Detection of coughs from user utterances using imitated phoneme model

نویسندگان

Shinya Takahashi

Tsuyoshi Morimoto

Sakashi Maeda

Naoyuki Tsuruta

چکیده

This paper proposes imitated phoneme models that represent non-verbal sounds, especially cough sounds here, as phoneme sequences. The purpose of this research is to detect the cough sounds from user utterances accurately for the home health care task because coughing is one of the most important barometers to check the health condition. To deal with the variety of the cough sounds, the imitated phoneme models are constructed by clustering of phoneme sequences obtained in phoneme recognition. The experimental results show that this approach can improve the correct rates and the accuracies for words and coughs compared with the approach using HMM constructed from cough waveforms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic model construction for a user oriented task using speech utterances

We propose a method to self-organize a user model for realizing a text search system. The method self-organizes a model from phoneme sequences and their texts using 6 basic rules. An inference method associated with the self-organized model searches texts corresponding to inputted phoneme sequences. Experiments using 50 voice waveforms of 10 speakers show that the self-organized model is useful...

متن کامل

Identifying Function-specific Prosodic Cues for Non-speech User Interface Sound Design

This study explores the potential of utilising certain prosodic qualities of function-specific vocal expressions in order to design effective non-speech user interface sounds. In an empirical setting, utterances with four context-situated communicative functions were produced by 20 participants. Time series of fundamental frequency (F0) and intensity were extracted from the utterances and analy...

متن کامل

Detecting user speech in barge-in over prompts using speaker identification methods

In this paper, we investigate the use of a speaker identi cation technique to solve the barge-in speech detection problem. This scenario is a very simple application of speaker identi cation since only two users are involved. This is further simpli ed by the fact that the prompt speaker can be modelled apriori. Additionally, the user can be modelled as well improving the performance of the syst...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Transfer learning for cross-lingual automatic speech recognition

In this study, an instance based transfer learning phoneme modeling approach is presented to mitigate the effects of limited data in a target language using data from richly resourced source languages. A maximum likelihood (ML) learning criterion is introduced to learn the model parameters of a given phoneme class using data from both the target and source languages. Each phoneme was modeled us...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Detection of coughs from user utterances using imitated phoneme model

نویسندگان

چکیده

منابع مشابه

Automatic model construction for a user oriented task using speech utterances

Identifying Function-specific Prosodic Cues for Non-speech User Interface Sound Design

Detecting user speech in barge-in over prompts using speaker identification methods

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Transfer learning for cross-lingual automatic speech recognition

عنوان ژورنال:

اشتراک گذاری